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Abstract 

The model of low-dimensional manifold and sparse representation are two well-known concise models that suggest 
each data can be described by a few characteristics. Manifold learning is usually investigated for dimension reduction 
by preserving some expected local geometric structures from original space to low-dimensional space. The structures 
are generally determined by using pairwise distance, e.g., Euclidean distance. Alternatively, sparse representation 
denotes a data point as a linear combination of the points from the same subspace. In practical applications, however, 
the nearby points in terms of pairwise distance, may not belong to the same subspace, and vice versa. Consequently, 
it is interesting and important to explore how to get a better representation by integrating these two models together. 
To this end, this paper proposes a novel coding algorithm, called Locality-Constrained Collaborative Representation 
(LCCR), which improves the robustness and discrimination of data representation by incorporating the locality based 
on pairwise distance into coding process. In addition, the proposed objective function has an analytical solution, and 
it does not involve local minima. The empirical studies based on three public facial databases, ORL, AR and Extend 
Yale B, demonstrate that LCCR outperforms three state-of-the-art models, sparse representation based classification 
(SRC) [1], ^^-FR [2] and CRC-RLS [3] in the context of recognizing human faces from frontal views with varying 
expression and illumination, as well as various corruptions and occlusions. 

Keywords: Non-sparse representation. Sparse representation, locality preservation, ^^-Minimization, 
-Minimization . 



1. Introduction 

Sparse coding or sparse representation has become a powerful method to address problems in machine intelli- 
gence [4, 5]. It assumes that each data point x g R"^ can be encoded as a linear combination of other points, i.e. 
X = Da, where D is a dictionary consisting of other points, and a is the representation of x over D. If most entries of 
a are zeros, then a is called a sparse representation. Generally, it can be achieved by solving 

(Pq) : min||a||o s.t. x = Da, 

where || • ||o denotes ^^-norm which counts the number of nonzero entries in a vector. Pq is diflScult to solve since it is 
a NP-hard problem. Recently, compressive sensing theory [6, 7, 8] have found that the solution of Pq equals that of 
^^-minimization problem, i.e., 

(Pij) : min||a||i s.t. x = Da, 

where the ^^-norm || • ||i sums the absolute value of all elements in a vector. Pij is convex and can be solved by a 
large amount of convex optimization methods, such as basis pursuit (BP) [9], least angle regression (LARS) [10]. In 
[11], Yang et al. make a comprehensive survey for some popular optimizers. 
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Benefiting from the emergence of compressed sensing theory, sparse coding has been widely used for various 
recognition tasks. Wright et al. [1] reported a remarkable method that uses sparse representation for face recognition, 
named SRC. They first encode a given image to a linear combination of training face images via ^^-minimization. Then 
they perform classification by finding which class produces the minimal reconstruction error (also called residual). 
SRC showed impressive accuracy and robustness, and has motivated a large amount of works in pattern recognition 
and machine learning, e.g., subspace learning [12, 13, 14], spectral clustering [15, 16], image classification [17], 
matrix factorization [18]. 

However, is ^^-norm based sparsity really necessary to improve the performance of face recognition? Four papers 
recently examined this problem. Yang et al. [19] discussed the connections and diff'erences between -optimizer 
and ^^-optimizer for SRC. Their theoretical analysis and experimental studies show that the success of SRC should at- 
tributes to the mechanism of -optimizer which selects the set of support training samples for the given testing sample 
by minimizing reconstruction error. In other words, the support training samples guarantee the global similarity but 
sparsity which is critical for pattern recognition. Rigamonti et al. [20] compared the discrimination of two diff'erent 
data models. One is the ^^-norm based sparse representation, and the other model is produced by passing input into 
a simple convolution filter. Their result showed that two approaches achieve a similar recognition rate. Therefore, 
^^-norm based sparsity is actually not as essential as it seems in the previous claims. Shi et al. [2] provided a more 
intuitive approach to investigate this problem by removing the ^^-regularization term from the objective function of 
SRC. Their experimental results showed that their method (^^-FR) achieves a higher recognition rate than SRC if the 
original data is available for ^^-FR. Moreover, Zhang et al. [3] replace the ^^-norm by the ^^-norm, and their experi- 
mental results again support the views that ^^-norm based sparsity is not necessary to improve the discrimination of 
data representation. 

As another extensively- studied concise model, manifold learning is usually investigated for dimension reduction 
by embedding local geometric structure of original data into a low-dimensional representation [21, 22, 23, 24, 25]. The 
geometric structure is generally determined by using pairwise distance, which is hardly reflected in sparse represen- 
tation. As the neighboring points in terms of pairwise distance may not belong to the same subspace (sub-manifold), 
and vice versa, the data models based on either locality or sparsity cannot represent well the real structure of data. 

To address this problem, some researchers have explored the possibility of integrating the locality with the sparsity 
together to produce a better data model. Richard et al. [26] successfully bridged the connections between sparse coding 
and manifold learning, and have founded the theory for random projections of smooth manifold; Majumdar et al. [27] 
investigated the eff'ectiveness and robustness of random projection method in classification task; Peng et al. [28] 
proposed a method to construct a similarity graph by enforcing locality onto a non-sparse graph. These works tried 
to introduce sparsity into the framework of graph-oriented algorithms, have attracted a lot of attentions and achieved 
state-of-the-art results in subspace learning and segmentation tasks. 

On the other hand, few works investigated the possibility of assimilating the geometric structure of data into the 
process of data modeling. To the best of our knowledge, the most similar works are: Wang et al. [29] proposed a 
dictionary learning method named locality-constrained linear coding (LLC) for hierarchal images classification; Chao 
et al. [30] presented a approach to unify group sparsity and data locality by introducing the term of ridge regression 
into LLC; Yang et al. [31] incorporated the prior knowledge into the coding process by iteratively learning a weight 
matrix of which the atoms measure the similarity between two data points. 

Motivated by the assumption that similar inputs have similar codes, we propose a coding algorithm for face recog- 
nition, named Locality-Constrained Collaborative Representation (LCCR). Here, collaborative representation means 
that the testing sample is encoded in terms of its intra-subspace and neighboring data points. LCCR incorporates 
the locality based on pairwise distance into the global representation over the training data for enforcing the the 
representation of input could reconstruct the input and its nearby points with the minimum reconstruction error simul- 
taneously. Note that, LCCR has an analytic solution and does not involve an alternative optimization process. The 
experimental results show that LCCR outperforms SRC, ^^-FR, CRC-RLS, Linear SVM on original input in accuracy 
and robustness in the context of face recognition. 

The remainder of paper is organized as follows: Section 2 introduces three state-of-the-art works for face recogni- 
tion based on data representation, i.e., SRC [1], ^^-FR [2] and CRC-RLS [3]. Section 3 presents our LCCR algorithm. 
Section 4 reports the experiments on facial databases. Finally, Section 5 contains the conclusion. 
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2. Preliminaries 



2.1. Notation 

We consider a set of N facial images collected from L subjects. Each training image, which is denoted as a vector 
d/ € R^, corresponds to the /th column of a dictionary D e M^^^. Without generality, we assume that the columns of 
D are sorted according to their labels. 

Except in some specified cases, lower-case bold letters represent column vectors and upper-case bold ones repre- 
sent matrices, denotes the transpose of the matrix A, A~^ represents the pseudo-inverse of A, and I is reserved for 
identity matrix. 

2.2. Sparse representation based classification 

Sparse coding aims at finding the most sparse solution of Pi j . However, in many practical problems, the constraint 
X = Da cannot hold exactly since the input x may include noise. Wright et al. [1] relaxed the constraint to ||x - Dalb < 
e, where e > is the error tolerance, then. Pi i is rewritten as: 

(Pi,2) : min||a||i s.t. ||x - Dalb < s. 

Using Lagrangian method. Pi, 2 can be transformed to the following unconstrained optimization problem: 

(Pi,3): argmin||x-Da||^ + ^||a||i, 

a 

where the scalar /I > balances the importance between the reconstruction error of x and the sparsity of code a. Given 
a testing sample x e R^, its sparse representation a* e R^ can be computed by solving Pi,2 or P13. 

After getting the sparse representation of x, one infers its label by assigning x to the class that has the minimum 
residual: 

rKx) = ||x-D.^,-(a*)||2, (1) 
identity(x) = argmin{r/(x)}. (2) 

i 

where the nonzero entries of ^K^*) ^ the entries in a* that are associated with /th class, and identity(x) denotes 

the label for x. 

2.3. -minimization based representation 

fi-¥K [2] and CRC-RLS [3] are two recently-proposed ^^-norm based coding algorithms which achieve state-of- 
the-art accuracy and computational efficiency in face recognition. ^^-FR removes the -regularization term from Pi,4, 
then the objective function becomes: 

(^^-FR) : argmin||x-Da||^, 

a 

Moreover, a key diff'erence between ^^-FR and SRC is that ^^-FR requires D to be an over-determined matrix for 
achieving competitive results, while the dictionary D of SRC must be under-determined to satisfy the conditions of 
compressive sensing. Once the optimal code a* is calculated for a given input, the classifier (1) and (2) is used to 
determine the label for the input x. 

CRC-RLS estimates the representation a* for the input x by relaxing the ^^-norm to the ^^-norm in Pi,3. They 
aimed to solve following objective function: 

(CRC-RLS) : argmin ||x - Da||^ + ^||a||^, 

a 

where /I > is a balance factor. 

^^-FR and CRC-RLS show that ^^-norm based data models can achieve competitive classification accuracy with 
hundreds of times speed increase, compared with SRC. Under this background, we aim to incorporate the local 
geometric structures into coding process for achieving better discrimination and robustness. 



3 



3. Locality-Constrained Collaborative Representation 

It is a big challenge to improve the discrimination and the robustness of facial representation because a practical 
face recognition system requires not only a high recognition rate but also the robustness against various noise and 
occlusions. As two of the most promising methods, locality preservation based algorithm and sparse representation 
have been extensively studied and successfully applied to appearance-based face recognition, respectively. Locality 
preservation based algorithm aims to find a low-dimensional model by preserving local neighborhood from original 
space to another one. It is derived from the assumption of manifold learning that the neighborhood of each point is 
homeomorphic to the Euclidean space if the data is well sampled from a smooth manifold. Alternatively, sparse 
representation, which encodes each testing sample as a linear combination of the training data, depicts a global 
relationship between testing sample with training ones. In this paper, we aim to integrate the locality based on pairwise 
distance with the global property derived from coding scheme together for modeling facial data. Our objective function 
is in the form of 

E(x,a) = ||x-Da||^+^||a||^+rEL, (3) 

where p = {1, 2}, El is the locality constraint, A > and y > dictate the importance of || • ||^ and El, respectively. 
Then the key is to formulate the geometric property of the neighborhood into El. 

Naturally, El can be defined as the reconstruction error of the neighborhood of the testing image, i.e., 

EL(Y(x),C)=i 2 llyKx)-Dc,||^, (4) 

y.(x)eY(x) 
CieC 

where, for an input x e M^, its neighborhood Y(x) e M^^^ is searched from the training samples according to prior 
knowledge or manual labeling. For simplicity, we assume that each data point has K neighbors, and C/ € denotes 
the optimal code for y/(x). 

To bridge the connection between the objective variant a and C/, one assumes that a could be represented as a 
linear combination of the {c/}^j. Mathematically, 

c,eC 

where W/ 6 is the similarity between a and C/. The calculation of W/ is a challenging and key step which has been 
studied in many works [29, 31]. Most of these works get an optimal via an iterative calculation process which 
increases additional computational costs. Moreover, another diflference between our work and these works is that we 
aim to denote a with C/, not vice versa. 

Here, we present a simple but eff'ective method to solve this problem by directly replacing C/ with a. This is based 
on an assumption that the representation of x also can approximate the representation of y^, i.e., 

Ily-Dcdl^<||y,--Da||^<||y,--Da||^, 

where a denotes the representation of the point which is not close to y^. 

Figure 1 show an example to illustrate the validity of our motivation. There are three face images A, B and C 
selected from two diff'erent individuals, where A and B came from the same person. This means that A and B lie 
on the same subspace and can represent each other with less reconstruction error. Figure 1(b) is a practical example 
corresponding to Figure 1(a). Either from the Eigenface matrices or the coefficients of the two coding schemes, we 
can see that the similarity between A and B is much higher than the similarity between C and either of them. 

Based on the above discussions, we propose an objective function as follows: 

E(x,Y(x),a) =(l-r)l|x-Da||^ 

+ 2 \\yi-^^\\l^M^\\p^ (5) 

y.(x)eY(x) 
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Figure 1: The key observation is that the representations of nearby points are more similar than that of non-neighboring points, (a) Three face 
images from two different sub-manifolds are linked to their corresponding neighbors respectively, (b) The first column includes three images which 
correspond to the points A, B and C in Figure 1(a); The second column shows the Eigenface feature matrices for the test images; The third column 
includes two parts: the left part is them code coefficients for SRC, and the right one is for CRC-RLS. 



where ^ y ^ 1 balances the importance between the testing image x and its neighborhood Y(x). The second term, 
which measures the contribution of locaHty, can largely improve the robustness of a. If x is corrupted by noise or 
occluded by disguise, a larger y will yield better recognition results. In addition, we can find that SRC, CRC-RLS 
and ^^-FR are three simple and special forms of our algorithm when y = 0, /? = 1, and y = 0, = 2, as well as /I = 0, 
7 = 0, /? = 2, respectively. 

Consider the recent findings, i.e., ^^-norm based sparsity cannot bring higher recognition accuracy and better 
robustness for facial data than ^^-norm based methods [2, 3], we simplify our objective function (5) as follows: 

E(x,Y(x),a) =(l-r)l|x-Da||^ 



+ X lly/«-Da||^ + ^||a| 

y/(x)GY(x) 

Clearly, (6) achieves the minimum when its derivative with respect to a is zero. Hence, the optimal solution is 



2' (6) 

y/(x)GY(x) 



(D D + ^ 1)- D 



(i-r)x + r-^ ^ y,(x) 

y/(x)GY(x) 



(7) 



Let P = (D^D + A ' I)"^D^ whose calculation requires re-formulating the psuedo-inverse. Clearly, P can be 
calculated in advance and only once as it is only dependent on training data D, 

Given a testing image x, the first step is to determine its neighborhood Y(x) from the training set according to 
prior knowledge, or manual labeling, etc. In practical applications, there are two widely-used variations for finding 
the neighborhood: 

1. 6-neighborhoods. The training sample d/ is a neighbor of the testing image x if ||d/ - x||2 < 6, where 6 > is a 
constant. 

2. ^-nearest neighbors (^-NN). The training sample d/ is a neighbor of x, if d/ is among the ^-nearest neighbors 
of X, where ^ > can be specified as a constant or determined adaptively. 

The two variations involve a choice of distance metrics. A lot of baselines are available, for example, Euclidean 
distance. In addition, supervised distance metric learning [32], which makes use of label information to identify 
correlations between dimensions, has shown significantly improvement of the accuracy of ^-NN searching. 

Once the neighborhood of the testing image x is obtained, LCCR just simply projects x and its neighborhood Y(x) 
onto space P via (7). Thus, LCCR is quite fast. In addition, the matrix form of LCCR is easily derived, which can 
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used in batch processing. 



A* = (D D + ^ I)- D 



1 ^ 

(i-r)x + r-y YKX) 



where the columns of X g R^^-^ are the testing images whose codes are stored in A* e R^^-^, and Y/(X) e R^^-^ 
denotes the collection of /th-nearest neighbor of X. 

The proposed LCCR algorithm is summarized in Algorithm 1, and an overview is illustrated in Figure 2. 



Algorithm 1 Face Recognition using Locality-Constrained Collaborative Representation (LCCR) 

Input: A matrix of training samples D = [di,d2, . . . ,dA^] e R^^^ which are sorted according to the label of d/, 
I < i < N, A testing image x e R^; The balancing factors A>0 and ^ y ^ 1. 
1: Normalize the columns of D and x to have a unit ^^-norm, respectively. 
Calculate the projection matrix P = (D^D + /I • I)"^D^ and store it. 
Find the neighborhood Y(x)={yi(x), y2(x), . . ., y/r(x)} for x from training samples D. 
Code X over D via 



P 



(i-r)x + r-^ ^ y,(x) 



y,(x)eY(x) 



5: Compute the regularized residuals over all classes by 



nix) = 



||x-D*^,(a*)||2 
P/(a*)||2 ' 



where / denotes the index of class. 
Output: identity (x) = argmin^{r/(x)}. 



Input: X 




Figure 2: Overview of the coding process of LCCR, which consists of three steps separated by dotted lines. First, for a given input x, find its 
neighborhood Y(x) from training data. Then, code x over D by finding the optimal representation a (see bar graph) which produces the minimal 
reconstruction errors for x and Y(x) simultaneously. Finally, conduct classification by finding which class produces the minimum residual. In the 
middle part of the figure, we use a red rectangles to indicate the basis vectors which produce the minimum residual. 



4. Experimental Verification and Analysis 

In this section, we report the performance of LCCR over three publicly-accessed facial databases, i.e., AR [33], 
ORL [34] and the Extended Yale database B [35], of which the images are captured in constrained environments. 
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There are some challenging unconstrained face data for classification, e.g., FERET [36], FRGC [37]. However, as the 
recent works on facial data representation [1, 2, 3], we examine the performance comparison between LCCR and these 
three coding schemes with respect to 1) discrimination, 2) robustness to corruptions, 3) and robustness to occlusions. 

4.1. Experimental Configuration 

We compare the classification results of LCCR with three state-of-the-art coding algorithms, i.e., SRC [1], l^- 
FR [2] and CRC-RLS [3], as well as the result of linear SVM [38] on the original input. Note that, LCCR, SRC, ^^-FR 
and CRC-RLS directly code each testing sample over training data without usability of dictionary learning method, 
and get classification result by finding which subject produces minimum reconstruction error. In these four models, 
only LCCR incorporates locality based pairwise distance into coding scheme. For a comprehensive comparison, we 
report the performance of LCCR with five basic distance metrics, i.e., Euclidean distance (^^ -distance), Seuclidean 
distance (standardized Euclidean distance). Cosine distance (the cosine of the angle between two points), Cityblock 
distance (^^ -distance), and Spearman distance. 

We get the sparse code of SRC by using the CVX [39], a package for solving convex optimization problems, and 
the non- sparse representations of ^^-FR and CRC-RLS by using the source code from the homepages of corresponding 
authors. All experiments are carried out using Matlab 32bit on a 2. 5 GHz machine with 2.00 GB RAM. 




Neighborhood parameter: K Sparsity parameter: X Locality penalty coefficient: 7 



(a) (b) (c) 

Figure 3: Recognition accuracy of LCCR using Cityblock distance on a subset of AR database with dimensionality 2580. (a) The recognition rates 
versus the variation of the neighborhood parameter K, where A = 0.005 and y = 0.2. (b) The recognition rates versus the variation of the sparsity 
parameter A, where ^ = 5 and y = 0.2. (c) The recognition rates versus the variation of the locality constrained coefficient y, where K = 3 and 
A = 0.005. 

Parameter determination is a big challenge in pattern recognition and computer vision. As in [15, 28], we report 
the best classification results of all tested methods under diff'erent parameter configurations. The values tested for the 
parameters to find the best values for LCCR can be inferred from Figure 3, and these possible values of A also are 
tested for SRC and CRC-RLS. In all tests, we randomly split each database into two parts for training and testing, and 
compare the performance of the algorithms using the same partition to avoid the diff'erence of data sets. 

4.2. Recognition on Clean Images 

In this sub-section, we examine the performance of five competing methods using three clean facial data sets. Here, 
clean image means an image without occlusion or corruption, just with variations in illumination, pose, expression, 
etc. 

4.2.7. ORL database 

The ORL database consists of 400 diff'erent images of 40 individuals. For each person, there are 10 images with 
the variation of lighting, facial expression and facial details (with or without glasses). For computational efficiency, 
we cropped all ORL images from 1 12 x 96 to 56 x 48, and randomly selected 5 images from each subject for training 
and used the remaining 5 images for testing. Table 1 reports the classification accuracy of the tested algorithms over 
various dimensionality. Note that, the Eigenface with 200D retains 100% energy of cropped data, which makes the 
investigated methods achieve the same rates over the data space with 2688D. From the results, LCCRs outperform the 
other algorithms, and the best results are achieved when Cityblock distance is used to search the nearest neighbors. 
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Moreover, there is an interesting result that except ^^-FR, the other algorithms achieve a higher recognition rate with 
the increase of dimensionality. The possible reason is that the cropped operation play a negative role to ^^-FR, while 
Eigenface increases the discrimination of model. In the following experiments, we will observe the same phenomenon. 



Table 1 : The Maximal Recognition Accuracy of Competing Algorithms on the ORL Database. 



Dim 


54 


120 


200 


2688 


SVM [38] 


90.00% 


92.50% 


93.50% 


93.00% 


SRC [1] 


92.00% 


96.50% 


86.00% 




i^-FR [2] 


92.50% 


91.00% 


89.00% 


89.00% 


CRC-RLS [3] 


94.50% 


94.00% 


94.50% 


95.00% 


LCCR + Cityblock 


97.50% 


97.50% 


98.00%. 


98.00% 


LCCR + SeucHdean 


96.00% 


96.50% 


96.00% 


96.50% 


LCCR + Euclidean 


96.00% 


96.00% 


96.50% 


96.50% 


LCCR + Cosine 


96.00% 


96.50% 


96.50% 


96.50% 


LCCR + Spearman 


96.00% 


96.00% 


96.00% 


96.00% 



4.2.2. AR database 

The AR database includes over 4000 face images of 126 people (70 male and 56 female) which vary in expression, 
illumination and disguise (wearing sunglasses or scarves). Each subject has 26 images consisting of 14 clean images, 
6 images with sunglasses and 6 images with scarves. As in [1, 3], a subset that contains 1400 normal faces randomly 
selected from 50 male subjects and 50 female subjects, is used in our experiment. For each subject, we randomly 
permute the 14 images and take the first half for training and the rest for testing. Limited by the computational 
capabilities, as in [3], we crop all images from original 165 x 120 to 60 x 43 (25 SOD) and convert it to gray scale. 

SRC [1] and CRC-RLS [3] achieved the classification result by extracting eigenface (PC A) with 54D, 120D and 
300D from cropped data, which means the dictionary D is an under-determined matrix. ^^-FR [2] reported the results 
for the original data, which means D is an over-determined matrix. For a extensive comparison, we investigate the 
performance of the tested methods over two cases except SRC on the latter one, since SRC requires D to be an 
under-determined matrix. 

From Table 2, we can see that LCCRs are superior to SVM, SRC, ^^-FR and CRC-RLS with respect to all 
examined dimensionality, and SRC achieves the second best result. 

Table 2: The Maximal Recognition Accuracy of Competing Algorithms on the AR Database. 



Dim 


54 


120 


300 


2580 


SVM [38] 


73.43% 


81.00% 


82.00% 


83.14% 


SRC [1] 


81.71% 


88.71% 


90.29% 




^2-FR [2] 


80.57% 


90.14% 


93.57% 


82.29% 


CRC-RLS [3] 


80.57% 


90.43% 


94.00% 


94.43% 


LCCR + Cityblock 


86.14% 


92.71% 


95.14% 


95.86% 


LCCR + Seuclidean 


85.00% 


91.86% 


94.43% 


95.43% 


LCCR + Euclidean 


84.00% 


91.29% 


94.14% 


94.86% 


LCCR + Cosine 


83.43% 


90.86% 


94.00% 


94.57% 


LCCR + Spearman 


84.71% 


90.71% 


94.14% 


94.43% 



4.2.3. Extended Yale B database 

The database contains 2414 frontal-face images with size 192 x 168 over 38 subjects, as in [1, 3], we conduct 
experiments on the cropped and normalized images of size 54 x 48. For each subject (about 64 images per subject), 
we randomly split the images into two parts with equal size, one for training, and the other for testing. Similar to the 
above experimental configuration, we calculate the recognition rates over the Eigenface spaces with dimensionality 
54, 120 and 300, and full dimensional space with 2592D. From Table 3, it is clear that LCCRs again outperform its 
counterparts across various spaces, especially when the Spearman distance is used to determine the neighborhood of 
testing samples. 

Based on the results on ORL, AR, and Extended Yale B, we draw following conclusions: 
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Table 3: The Maximal Recognition Accuracy of Competing Algorithms on the Extended Yale B Database. 



Dim 


54 


120 


300 


2592 


SVM [38] 


84.52% 


92.72% 


95.28% 


95.45% 


SRC [1] 


93.71% 


95.12% 


96.44% 




^2-FR [2] 


92.88% 


95.61% 


97.85% 


90.48% 


CRC-RLS [3] 


92.96% 


95.69% 


97.90% 


98.26% 


LCCR + Cityblock 


93.21% 


96.03% 


97.93% 


98.34% 


LCCR + SeucHdean 


93.21% 


95.70% 


97.93% 


98.34% 


LCCR + EucHdean 


93.21% 


95.70% 


97.93% 


98.34% 


LCCR + Cosine 


93.46% 


95.78% 


97.93% 


98.59% 


LCCR + Spearman 


97.02% 


98.18% 


99.10% 


99.59% 



1. For all databases used, LCCRs outperforms SVM (original input), SRC (sparse code), ^^-FR (non-sparse rep- 
resentation) and CRC-RLS (collaborative representation) over the all tested dimensionalities. 

2. LCCR perform better for low-dimensional data than high-dimensional ones. For example, on the Extended Yale 
B, the maximal rate difference between LCCR and CRC-RLS (the second best method) changed from 4.06% 
with 54D, to 2.49% with 120D, and to 1.17% with 300D. It again corroborates our claim that locality based 
on pairwise is helpful to improving the discrimination of data representation, since the low-dimensional data 
contain few information than higher one. 

3. SRC outperform ^^-FR and CRC-RLS over the low-dimensional feature spaces, which is consistent with the 
reports in [3]. CRC-RLS and ^^-FR achieve similar recognition rates with the difference less than 1% across 
various feature spaces. 



4.3. Recognition on Partial Facial Features 



u 



(a) 



Features 


Right Eye 


Mouth and Chin 


Nose 


Dim 


308 


798 


224 


SVM [38] 


70.71% 


41.29% 


37.14% 


SRC [1] 


84.00% 


70.71% 


78.00% 


[2] 


72.86% 


32.86% 


70.57% 


CRC-RLS [3] 


83.14% 


73.86% 


73.57% 


LCCR + Cityblock 


86.86% 


76.29% 


75.86% 


LCCR + Seuclidean 


84.86% 


75.57% 


75.29% 


LCCR + Euclidean 


85.29% 


75.00% 


76.00% 


LCCR + Cosine 


84.43% 


74.57% 


75.29% 


LCCR + Spearman 


84.57% 


75.86% 


75.14% 



(b) 



Figure 4: Recognition Accuracy with partial face features, (a) An example of the three features, right eye, mouth and chin, and nose from left to 
right, (b) The recognition rates of competing methods on the partial face features of the AR database. 



The ability to work on partial face features is very interesting since not all facial features play an equal role in 
recognition. Therefore, this ability has become an important metric for the recognition algorithm [40]. We examine 
the performance of the tested methods on three partial facial features, i.e., right eye, nose, as well as mouth and chin, 
sheared from the clean AR faces with 25 SOD (as shown in Figure 4(a)). For each partial face feature, we generate 
a data set by randomly selecting 7 images per subject for training and the remaining 700 for testing. It should be 
noted that [1] conducted similar experiment on Extended Yale B which includes less subjects, smaller irrelevant white 
background, and more training samples per subject than our case. 

Figure 4(b) shows that the proposed LCCRs achieve better recognition rates than SVM, SRC, ^^-FR and CRC- 
RLS for right eye as well as mouth and chin, and the second best rates for the nose. Some works found that the most 
important feature is the eye, followed by the mouth, and then the nose [41]. We can see that the results for SVM, 
CRC-RLS and LCCR are consistent with these studies although the dominance of the mouth and chin over the nose 
is not very distinct. 
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4.4. Face Recognition with Block Occlusions 




(a) (b) (c) 



Dim 


300 (Eigenface) 


2580 


Occlusion ratio 


10% 


20% 


30% 


40% 


50% 


10% 


20% 


30% 


40% 


50% 


LCCR + Cityblock 


90.71% 


84.43% 


74.14% 


61.43% 


51.43% 


91.57% 


85.57% 


76.43% 


62.29% 


52.29% 


LCCR + Seuclidean 


90.00% 


80.14% 


67.86% 


53.71% 


39.57% 


91.00% 


82.14% 


70.57% 


56.29% 


41.71% 


LCCR + Euclidean 


88.86% 


80.86% 


68.14% 


55.00% 


43.29% 


89.71% 


81.86% 


71.00% 


57.00% 


43.86% 


LCCR + Cosine 


88.57% 


80.29% 


69.29% 


56.57% 


44.00% 


89.00% 


82.43% 


71.00% 


57.29% 


45.57% 


LCCR + Spearman 


88.29% 


81.00% 


70.00% 


57.14% 


43.29% 


89.57% 


83.00% 


72.57% 


59.43% 


44.14% 



(d) 



Figure 5: Experiments on AR database with varying percent block occlusion, (a) From top to bottom, the occlusion percents for test images 
are, 10%, 30%, and 50%, respectively, (b) and (c) are the recognition rates under different levels of block occlusion on AR database with 300D 
(Eigenface) and 2580D, respectively, (d) The recognition rates of LCCRs which outperform the other tested models. 

To examine the robustness to block occlusion, similar to [1, 2, 3], we get 700 testing images by replacing a random 
block of each clean AR image with an irrelevant image (baboon) and use 700 clean images for training. The occlusion 
ratio increases from 10 percent to 50 percent, as shown in Figure 5(a). We investigate the classification accuracy of 
all tested methods over Eigenface space with 300D (Figure 5(b)) and cropped data space with 25 SOD (Figure 5(c)). 

Figure 5(b) and Figure 5(c) show that LCCRs again outperform the other models with considerable performance 
margins. Especially, with the increase of the occlusion ratio, the diff'erence of recognition rates between LCCRs and 
the other methods becomes larger. For example, when the occlusion ratio is 50%, in 300 dimensional space, the 
accuracy of LCCR with Cityblock distance is about 20% higher than SVM, about 26% higher than SRC, about 20% 
higher than ^^-FR, and about 15% higher than CRC-RLS. 

On the other hand, it is easy to find that ^^-FR, CRC-RLS and LCCRs are more robust than SRC and SVM, which 
implies that the ^^-regularization term cannot yield better robustness than the ^^-regularization term, at least for the 
Eigenface space. Moreover, all models achieve better results in higher dimensional space, although the diff'erence of 
classification accuracy between higher dimensional space and lower ones is less than 1%. 

4.5. Face Recognition with Real Occlusions 

In this sub-section, we examine the robustness to real possible occlusions of competing methods by using the AR 
database distributed over 100 individuals. We use 1400 clean images for training, 600 faces wearing by sunglasses 
(occluded ratio is about 20%) and 600 face wearing by scarves (occluded ratio is about 40%) for testing, separately. In 
[1], Wright et al. only used a third of disguised images for this test, i.e., about 200 images for each kind of disguise. In 
addition, this sub-section investigate the relationship between the eff'ectiveness of ^-NN searching and that of LCCR. 

We examine two widely-used feature schemes [1, 3, 31], namely, the holistic feature with 54D, 300D and 2580D, 
as well as the partitioned feature based on the cropped data with 25 SOD. The partitioned feature scheme firstly parti- 
tions an image into multiple blocks (8 blocks in our case as did in [1, 3, 31], see Figure 6(a) and 6(c)), then conducts 
classification on each block independently, and after that, aggregates the results by voting. 

Figure 6(e) reports the recognition rates of all tested methods. For the images occluded by sunglasses, LCCR with 
Cityblock distance achieves remarkable results with the holistic feature scheme, its recognition accuracy are nearly 
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(a) (b) (c) (d) 



Disguise 


sunglasses 


scarves 


Feature 


Holistic 


Partitioned 


Holistic 


Partitioned 


Dim 


54 


300 


2580 


2580 


54 


300 


2580 


2580 


SVM [38] 


45.50% 


47.83% 


48.67% 


40.17% 


11.33% 


13.50% 


13.83% 


41.67% 


SRC [1] 


36.67% 


57.00% 




93.00% 


26.67% 


69.83% 




91.83% 


[2] 


32.83% 


52.83% 


49.17% 


88.17% 


21.50% 


68.50% 


57.50% 


91.83% 


CRC-RLS [3] 


42.17% 


53.00% 


71.50% 


88.33% 


21.50% 


68.50% 


89.17% 


92.17% 


LCCR + Cityblock 


85.33% 


93.00% 


93.50% 


91.17% 


22.17% 


68.67% 


89.17% 


92.50% 


LCCR + Seuclidean 


42.83% 


57.83% 


74.50% 


91.50% 


22.00% 


68.50% 


89.17% 


92.17% 


LCCR + Euclidean 


47.33% 


66.00% 


79.00% 


90.83% 


22.17% 


68.67% 


89.17% 


92.50% 


LCCR + Cosine 


65.50% 


76.83% 


85.33% 


93.83% 


21.17% 


68.83% 


89.50% 


93.83% 


LCCR + Spearman 


58.67% 


70.67% 


81.17% 


95.83% 


21.67% 


68.67% 


89.33% 


93.83% 



(e) 



Figure 6: Recognition on AR faces with real possible occlusions, (a) The top row is a facial image occluded by sunglass, whose partitioned blocks 
are shown as below, (b) The accuracy of K-NN searching using Cityblock distance, Cosine distance, Euclidean distance, Seuclidean distance and 
Spearman distance on the AR images with sunglasses (2580D). (c) Similar to (a), the top row is a face occluded by scarf, and its partitions below, 
(d) The precision of K-NN searching using Cityblock, Cosine, Euclidean, Seuclidean, Spearman as distance metrics on the AR images with scarves 
(2580D). (e) The recognition rates of competing methods across different experimental configurations. 

double that of the other methods. This considerable performance margin contributes to that the locaHty based on 
Cityblock distance and largely improves the discrimination of representation when traditional coding scheme cannot 
work well (see Figure 6(b)). For the images occluded by scarves, LCCR achieves the highest recognition rate over the 
full dimensional space, and the second highest rates using Eigenface. However, the difference in rates between LCCR 
and other non-iterative algorithms (^^-FR, CRC-RLS) is very small due to the poor accuracy of ^-NN searching as 
shown in Figure 6(d). Furthermore, the partitioned feature provides higher recognition rates than the holistic one for 
all competing methods, which is consistent with previous report [1]. 

From the above experiments, it is easy to conclude that the preservation of locality is helpful to coding scheme, 
especially when the real structures of data cannot be found by traditional coding scheme. Moreover, the performance 
ranking of LCCR with five distance metrics is same with that of ^-NN searching in terms of these metrics. 

4.6. Face Recognition with Corruption 

We test the robustness of LCCR against two kinds of corruption using the AR data set containing 2600 images 
of 100 individuals. For each subject, we use 13 images for training (7 clean images, 3 images with sunglasses, and 
3 images with scarves), and the remaining 13 images for testing. Diff'erent from [1] which tested the robustness to 
corruption using the Extended Yale B database, our case is more challenging for the following reasons. Firstly, AR 
images contain real possible occlusions, i.e., sunglasses and scarves, while Extended Yale B is a set of clean images 
without disguises. Secondly, AR includes more facial variations (13 versus 9), more subjects (100 versus 38), and 
a smaller samples for each subject (26 images per subject versus 64 images per subject). Thirdly, we investigated 
two kinds of corruption, white noise (additive noise) and random pixel corruption (non-additive noise) which are two 
commonly assumed in face recognition problem [1, 2]. For the white noise case (the top row of Figure 7), we add 
random noise from normal distribution to each testing image x, that is, x = x -I- an, and restrict x g [0 255], where 
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a is the corruption ratio from 10% to 100% with an interval of 10%, and n is the noise following a standard normal 
distribution. For the random pixel corruption case (the bottom row in Figure 7), we replace the value of a percentage 
of pixels randomly chosen from each test image with the values following a uniform distribution over [0 Pmax], where 
Pmax is the largest pixel value of current image. 



1 










^^^^^^^ 











Figure 7: Testing images from AR database with corruption. Top row: 10%, 30%, 50%, 70%, 90% white noises are added into test image; Bottom 
row, 20%, 40%, 60%, 80%, 100% of pixels are randomly corrupted. 



Table 4: The robustness to white gaussian noise using Eigenface (300D) on the AR database (coding over D). 



Corruption ratio 


10% 


20% 


30% 


40% 


50% 


60% 


70% 


80% 


90% 


100% 


SVM [38] 


91.77% 


91.46% 


91.38% 


91.15% 


90.23% 


89.85% 


88.62% 


86.08% 


82.69% 


80.08% 


SRC [1] 


92.62% 


91.54% 


91.23% 


89.23% 


86.54% 


82.54% 


78.31% 


72.08% 


62.62% 


56.46% 


^2-FR [2] 


93.39% 


92.92% 


92.39% 


91.23% 


88.85% 


85.23% 


81.85% 


75.54% 


67.62% 


62.23% 


CRC-RLS [3] 


94.77% 


94.46% 


94.39% 


94.15% 


92.85% 


91.77% 


90.92% 


89.23% 


87.31% 


84.92% 


LCCR + Cityblock 


97.00% 


96.08% 


96.00% 


95.62% 


94.54% 


94.15% 


92.31% 


90.85% 


89.08% 


87.39% 


LCCR + Seuclidean 


96.31% 


95.77% 


95.77% 


95.54% 


94.46% 


94.39% 


92.39% 


90.85% 


88.54% 


86.69% 


LCCR + Euclidean 


95.77% 


95.46% 


95.23% 


95.15% 


94.23% 


94.08% 


92.31% 


90.77% 


88.31% 


86.54% 


LCCR + Cosine 


95.62% 


95.15% 


95.15% 


95.08% 


93.92% 


93.85% 


92.15% 


90.31% 


88.69% 


86.92% 


LCCR + Spearman 


96.15% 


95.46% 


95.39% 


95.15% 


94.69% 


94.23% 


93.08% 


92.15% 


89.77% 


89.00% 


Table 5: The robustness to white 


gaussian noises using Eigenface (300D) on the AR database (coding over [D I]). 




Corruption ratio 


10% 


20% 


30% 


40% 


50% 


60% 


70% 


80% 


90% 


100% 


SRC [1] 


92.62% 


91.77% 


92.23% 


91.08% 


90.46% 


89.39% 


87.92% 


84.15% 


84.08% 


78.92% 


^2-FR [2] 


92.62% 


92.23% 


92.15% 


91.92% 


91.00% 


90.62% 


88.85% 


87.31% 


86.31% 


82.15% 


CRC-RLS [3] 


92.62% 


92.23% 


92.15% 


91.92% 


91.00% 


90.69% 


88.85% 


87.31% 


86.31% 


82.15% 


LCCR + Cityblock 


93.08% 


92.92% 


92.85% 


92.39% 


91.69% 


91.15% 


89.62% 


88.08% 


86.62% 


84.23% 


LCCR + SeucHdean 


92.92% 


92.85% 


92.62% 


92.39% 


91.69% 


91.23% 


89.62% 


88.15% 


86.46% 


83.62% 


LCCR + Euclidean 


92.85% 


92.77% 


92.54% 


92.39% 


91.69% 


91.39% 


89.92% 


88.31% 


86.62% 


83.46% 


LCCR + Cosine 


92.77% 


92.69% 


92.69% 


92.62% 


91.92% 


91.69% 


89.92% 


88.39% 


87.00% 


83.85% 


LCCR + Spearman 


93.00% 


92.85% 


92.85% 


92.85% 


92.15% 


91.62% 


90.15% 


89.54% 


87.54% 


86.39% 


Table 6: The robustness to white gaussian noises using the AR database (2580D) (coding over D). 


Corruption ratio 


10% 


20% 


30% 


40% 


50% 


60% 


70% 


80% 


90% 


100% 


SVM [38] 


91.92% 


91.92% 


91.69% 


91.31% 


90.00% 


89.85% 


88.62% 


85.62% 


82.92% 


79.85% 


^2-FR [2] 


78.69% 


55.23% 


33.77% 


17.00% 


4.62% 


3.38% 


4.62% 


3.38% 


2.69% 


2.31% 


CRC-RLS [3] 


94.92% 


94.85% 


94.77% 


93.92% 


93.23% 


92.23% 


90.85% 


89.31% 


88.39% 


85.39% 


LCCR + Cityblock 


97.54% 


96.54% 


96.08% 


95.85% 


95.08% 


94.92% 


93.15% 


91.54% 


90.54% 


86.77% 


LCCR + SeucHdean 


96.92% 


96.23% 


96.08% 


95.92% 


95.39% 


94.62% 


92.92% 


91.15% 


89.00% 


87.00% 


LCCR + Euclidean 


96.08% 


95.62% 


95.46% 


95.39% 


94.85% 


94.23% 


92.23% 


91.31% 


88.92% 


87.08% 


LCCR + Cosine 


96.08% 


95.62% 


95.46% 


95.31% 


94.39% 


94.31% 


92.54% 


90.62% 


89.46% 


87.23% 


LCCR -1- Spearman 


96.54% 


95.77% 


95.62% 


95.23% 


94.69% 


94.54% 


93.31% 


92.23% 


90.39% 


89.39% 



To improve the anti-noise ability of SRC [1], Wright et al. generate a new dictionary [D I] by concatenating 
an identity matrix I with the original dictionary D, where the dimensionality of I equals to that of data. The use 
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Table 7: The robustness to white gaussian noises using the AR database (25 SOD) (coding over [D I]). 



Corruption ratio 


10% 


20% 


30% 


40% 


50% 


60% 


70% 


80% 


90% 


100% 


^^-FR [2] 


93.00% 


93.00% 


92.46% 


92.77% 


91.69% 


90.92% 


90.15% 


88.15% 


87.15% 


83.08% 


CRC-RLS [3] 


93.00% 


93.00% 


92.46% 


92.77% 


91.69% 


90.92% 


89.23% 


88.15% 


87.15% 


83.08% 


LCCR + Cityblock 


93.39% 


93.31% 


93.31% 


93.00% 


92.15% 


91.85% 


90.15% 


88.77% 


87.39% 


85.31% 


LCCR + Seuclidean 


93.39% 


93.00% 


93.00% 


92.40% 


92.15% 


91.69% 


90.15% 


88.77% 


87.31% 


84.39% 


LCCR + Euclidean 


93.31% 


93.08% 


93.23% 


92.77% 


91.92% 


91.77% 


90.31% 


88.85% 


87.62% 


84.46% 


LCCR + Cosine 


93.15% 


93.08% 


93.08% 


93.00% 


92.23% 


92.08% 


90.39% 


89.39% 


87.77% 


84.46% 


LCCR + Spearman 


93.54% 


93.39% 


93.39% 


93.15% 


92.62% 


92.15% 


90.54% 


90.23% 


88.31% 


87.08% 


Table 8: The robustness to random pixel corruption using Eigenface (300D) on the AR database (coding 


over D). 




Corruption ratio 


10% 


20% 


30% 


40% 


50% 


60% 


70% 


80% 


90% 


100% 


SVM [38] 


91.54% 


88.00% 


81.92% 


66.54% 


45.46% 


23.00% 


8.23% 


3.69% 


2.00% 


1.15% 


SRC [1] 


89.62% 


84.31% 


72.31% 


55.15% 


38.85% 


20.31% 


8.23% 


4.62% 


2.00% 


1.00% 


^2-FR [2] 


91.77% 


86.62% 


77.00% 


62.08% 


45.77% 


28.46% 


13.62% 


4.77% 


2.54% 


1.54% 


CRC-RLS [3] 


84.08% 


91.62% 


88.69% 


80.23% 


65.46% 


44.77% 


20.77% 


6.69% 


2.92% 


1.62% 


LCCR + Cityblock 


96.54% 


95.23% 


92.31% 


87.77% 


79.69% 


62.31% 


37.08% 


15.39% 


5.23% 


1.62% 


LCCR + Seuclidean 


95.69% 


93.39% 


90.08% 


81.92% 


65.85% 


44.85% 


20.77% 


1.46% 


3.00% 


1.69% 


LCCR + EucHdean 


95.39% 


92.85% 


90.39% 


82.85% 


67.23% 


45.08% 


20.92% 


6.69% 


3.23% 


1.62% 


LCCR + Cosine 


94.85% 


92.46% 


89.46% 


81.39% 


65.62% 


45.08% 


20.85% 


6.77% 


3.69% 


2.08% 


LCCR + Spearman 


95.54% 


94.08% 


92.54% 


88.62% 


83.00% 


74.15% 


59.31% 


38.00% 


13.69% 


1.92% 



Table 9: The robustness to random pixel corruption using Eigenface (300D) on the AR database (coding over [D I]). 



Corruption ratio 


10% 


20% 


30% 


40% 


50% 


60% 


70% 


80% 


90% 


100% 


SRC [1] 


91.62% 


88.85% 


83.38% 


73.92% 


56.31% 


33.08% 


14.92% 


4.92% 


2.08% 


1.31% 


^2-FR [2] 


91.15% 


89.46% 


84.46% 


71.31% 


49.92% 


25.69% 


8.31% 


2.92% 


1.85% 


1.15% 


CRC-RLS [3] 


91.15% 


89.46% 


84.46% 


71.31% 


49.92% 


25.69% 


8.31% 


2.92% 


1.85% 


1.15% 


LCCR + Cityblock 


92.62% 


90.92% 


87.54% 


81.15% 


69.46% 


53.08% 


34.00% 


14.62% 


4.92% 


1.38% 


LCCR + SeucHdean 


91.85% 


89.77% 


85.23% 


72.39% 


50.00% 


25.92% 


8.31% 


3.31% 


2.46% 


1.31% 


LCCR + EucHdean 


91.92% 


89.85% 


85.15% 


73.23% 


51.23% 


27.62% 


10.23% 


4.00% 


2.54% 


1.31% 


LCCR + Cosine 


92.15% 


90.00% 


84.92% 


71.46% 


49.92% 


25.69% 


8.38% 


5.31% 


2.62% 


1.23% 


LCCR + Spearman 


92.69% 


90.85% 


87.85% 


83.08% 


75.92% 


66.69% 


56.54% 


37.85% 


1.38% 


1.77% 



Table 10: The robustness to random pixel corruption using the AR database (2580D) (coding over D). 



Corruption ratio 


10% 


20% 


30% 


40% 


50% 


60% 


70% 


80% 


90% 


100% 


SVM [38] 


91.69% 


88.38% 


81.46% 


65.62% 


44.77% 


22.23% 


7.69% 


3.69% 


1.92% 


1.23% 


^2-FR [2] 


21.77% 


8.31% 


3.77% 


3.23% 


2.39% 


1.92% 


0.92% 


1.23% 


1.15% 


1.00% 


CRC-RLS [3] 


94.15% 


92.00% 


89.08% 


82.00% 


67.08% 


47.08% 


22.69% 


7.00% 


2.62% 


1.23% 


LCCR + Cityblock 


96.85% 


95.62% 


93.23% 


88.62% 


78.77% 


57.54% 


29.77% 


8.08% 


4.54% 


1.62% 


LCCR + SeucHdean 


96.00% 


94.23% 


90.77% 


82.85% 


67.31% 


47.08% 


22.69% 


7.15% 


3.00% 


1.54% 


LCCR + EucHdean 


95.62% 


93.62% 


91.15% 


83.77% 


68.31% 


47.39% 


22.92% 


7.15% 


3.00% 


1.54% 


LCCR + Cosine 


95.31% 


93.31% 


90.77% 


82.62% 


67.15% 


47.15% 


23.23% 


7.23% 


3.62% 


1.77% 


LCCR -1- Spearman 


95.85% 


94.77% 


92.92% 


89.23% 


83.31% 


74.77% 


60.69% 


38.00% 


13.85% 


1.85% 


Table 11: The robustness to random pixel corruption using the AR database (2580D) (coding over [D I]). 


Corruption ratio 


10% 


20% 


30% 


40% 


50% 


60% 


70% 


80% 


90% 


100% 


^^-FR [2] 


92.15% 


89.85% 


84.92% 


73.15% 


51.15% 


26.15% 


8.54% 


2.92% 


1.77% 


1.15% 


CRC-RLS [3] 


92.15% 


89.85% 


84.92% 


73.15% 


51.15% 


26.15% 


8.54% 


3.00% 


1.77% 


1.15% 


LCCR + Cityblock 


93.08% 


91.39% 


88.23% 


81.92% 


70.15% 


53.54% 


34.08% 


14.62% 


5.08% 


1.38% 


LCCR + Seuclidean 


92.69% 


90.39% 


85.85% 


73.85% 


51.39% 


26.15% 


8.62% 


3.31% 


2.46% 


1.31% 


LCCR + EucHdean 


92.62% 


90.39% 


86.00% 


74.46% 


52.46% 


28.54% 


10.69% 


4.08% 


2.54% 


1.38% 


LCCR + Cosine 


92.69% 


90.54% 


85.62% 


73.15% 


51.59% 


26.15% 


8.62% 


6.08% 


2.77% 


1.31% 


LCCR -1- Spearman 


93.31% 


91.54% 


88.54% 


84.31% 


77.00% 


67.00% 


56.54% 


37.85% 


13.62% 


1.62% 



of I has been verified to be eff'ective in improving the robustness of ^ -norm based models [13, 42], while it will 
increase the computational costs. Therefore, this is a tradeoff" between robustness and computational efficiency for 
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these algorithms. Will the strategy still work for ^^-minimization based models? In this sub- section, we fill this gap 
by comparing the results by coding over these two dictionary. 

Table 4 through Table 1 1 are the recognition rates of the tested methods across feature space (Eigenface with 
300D) and full dimensional space (2580D). The results of SVM with the strategy of expanding dictionary are not 
reported as SVM is directly conducted on the input without encoding. Based on the results, we have the following 
conclusions: 

Firstly, the proposed LCCRs are much superior to SVM, SRC, ^^-FR and CRC-RLS with considerable margins 
in various tests. For example, in the worst case (the white noise corruption ratio is 100%, Table 4-7), the best result 
of LCCR, which is achieved with Spearman distance, is about 89.39% (Table 6), compared to 80.08% for SVM 
(Table 4), 78.92% for SRC (Table 5), 83.05% for ^^-FR (Table 5), and 85.39% for CRC-RLS (Table 6). In the case of 
random pixel corruption (Table 8-11), one can see when the corruption ratio reaches 70%, all methods fail to perform 
recognition except LCCR whose best result is about 60.69% (Table 10), compared to 8.23% for SVM (Table 8), 
14.92% for SRC (Table 9), 13.62% for ^^.pR (Table 8), and 22.69% for CRC-RLS (Table 10). 

Secondly, all investigated algorithms perform worse with increased corruption ratio, and achieve better results 
in white noise corruption (additive noise) than random pixel corruption (non-additive noise). Moreover, all models 
perform slightly better (less than 1%) over full-dimensional space excepted ^^-FR. 

Thirdly, the results show that coding over [D I] is helpful in improving the robustness of SRC and ^^-FR, but it 
has negative impact on the recognition accuracy of CRC-RLS and LCCR. For example, when white noise ratio rises 
to 100% for the Eigenface (Table 4 and 5), expanding D leads to the variation of the recognition rate from 56.46% 
to 78.92% for SRC, from 62.23% to 82.15% for ^^.pR, from 84.92% to 82.12% for CRC-RLS, and from 89.00% to 
86.39% for LCCR with Spearman distance. This result has not been reported in previous works. 

5. Conclusions and Discussions 

It is interesting and important to improve the discrimination and robustness of data representation. The traditional 
coding algorithm gets the representation by coding each datum as a linear combination of a set of training set, which 
depicts the global structure and property of data. However, it will be failed when the data are corrupted. Locality 
preservation has shown the effectiveness in revealing the structure of data, which could keep the geometric structure 
of manifold. In this paper, we proposed a novel objective function to get an effective and robust representation by 
incorporating the locality into the coding scheme, and the function possesses analytic solution. Moreover, two recent 
proposed models, ^^-FR and CRC-RLS, could be regarded as two special versions of our model without locality 
constraint. The experimental studies showed that the introduction of locality makes LCCR more accurate and robust 
to various occlusions and corruptions. We analyze the classification results of LCCR with five basic distance metrics 
(for locality). The results imply that if better ^-NN searching methods or more sophisticated distance metrics are 
adopted, such as, the supervised distance metric learning [43], LCCR will achieve higher recognition result and better 
robustness. Moreover, the performance comparisons over two different dictionaries show that it is unnecessary to 
expand the dictionary D with I for ^^-norm based coding algorithms. 

Each approach has its own advantages and disadvantages. Parameter determination maybe is the biggest problem 
of LCCR which requires the specification of three parameters, locality parameter K and two constraint parameters A 
and y. In the future works, we will focus on some of these issues. For example, it is possible that the value of K can 
refer to the intrinsic dimensionality of subspace. Moreover, dictionary learning plays an important role in information 
representation. Hence, it is valuable to explore how to exploit low-dimensional geometric models to build an effective 
dictionary. 
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